Extracting Social Networks from Literary Text with Word Embedding Tools
نویسندگان
چکیده
In this paper a social network is extracted from a literary text. The social network shows, how frequent the characters interact and how similar their social behavior is. Two types of similarity measures are used: the first applies co-occurrence statistics, while the second exploits cosine similarity on different types of word embedding vectors. The results are evaluated by a paid micro-task crowdsourcing survey. The experiments suggest that specific types of word embeddings like word2vec are well-suited for the task at hand and the specific circumstances of literary fiction text.
منابع مشابه
Extracting Networks of People and Places from Literary Texts
We describe a method to automatically extract social networks from literary texts. Similar to those in prior research, nodes represent characters found in the texts; edges connect them to other characters with whom they interact, and also display sentences describing their interactions. Furthermore, other nodes encode places and are connected to characters who were active there. Thus, these net...
متن کاملQuote Attribution for Literary Text with Neural Networks
We propose a method for using neural networks to attribute quotes in literary texts. Since previous work has been unable to successfully solve this problem based on bag-of-words features, we study the issue of whether this is due to the limited expressiveness of such features. By re-framing the modeling of quotes and characters as based off of word vectors, we hope to demonstrate that individua...
متن کاملExploring the Betrothed Lovers
We present the ongoing activities and the first results achieved in a research project concerning the understanding of narrative in the high school. Students and teachers experimented with new ways to learn linguistic and digital skills, by using a collaborative learning environment built around the novel I Promessi Sposi. We analyzed the literary text, extracting social networks of characters ...
متن کاملUsing word embedding for bio-event extraction
Bio-event extraction is an important phase towards the goal of extracting biological networks from the scientific literature. Recent advances in word embedding make computation of word distribution more efficient and possible. In this study, we investigate methods bringing distributional characteristics of words in the text into event extraction by using the latest word embedding methods. By us...
متن کاملThe Actor-Topic Model for Extracting Social Networks in Literary Narrative
We present a generative model for conversational dialogues, namely the actortopic model (ACTM), that extend the author-topic model (Rosen-Zvi, et.al, 2004) to identify actors of given conversation in literary narratives. Thus ACTM assigns each instance of quoted speech to an appropriate character. We model dialogues in a literary text, which take place between two or more actors conversing on d...
متن کامل